Evaluation of DFTDS algorithm for distributed data warehouse
نویسندگان
چکیده
منابع مشابه
Interoperable Distributed Data Warehouse Components
Extraction, Transformation and Loading (ETL) are the major functionalities in data warehouse (DW) solutions. Lack of component distribution and interoperability is a gap that leads to many problems in the ETL domain, because these ETL components are tightly-coupled in the current ETL framework. Furthermore, complexity of components extensibility is another gap in the ETL area, because of the sa...
متن کاملEntropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملExperimental Evaluation of Data Warehouse Configuration Algorithms
A Data Warehouse (DW) can be seen as a set of materialized views defined over relations that are stored in remote heterogeneous database systems. When a query is posed to the DW, it is evaluated locally, using only the materialized views. The DW configuration problem is the problem of selecting an optimal set of views to materialize that answer a given set of queries. The objective is the minim...
متن کاملBrown Dwarf: A Distributed Data Warehouse for the Cloud
In this paper we present the Brown Dwarf, a distributed system designed to efficiently store, query and update multidimensional data over commodity network nodes, without the use of any proprietary tool. Brown Dwarf manages to distribute a highly effective centralized structure among peers on-the-fly, reducing cube creation and query times by enforcing parallelization. Both point and aggregate ...
متن کاملEfficiency evaluation of data warehouse operations
We present an efficiency model for data warehouse operations and analyze a data set to evaluate the model. The model contains salient variables to evaluate the efficiency of an organization’s data warehouse operations for refresh processing and query production. The variables in the model include resource consumption (labor usage and computing budgets), system usage measures (users and queries)...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Egyptian Informatics Journal
سال: 2014
ISSN: 1110-8665
DOI: 10.1016/j.eij.2013.10.002